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Global spectrum fluctuations for the /^-Hermite and 
/^-Laguerre ensembles via matrix models 

loana Dumitriu and Alan Edelman 
February 7, 2008 

Abstract 

We study the global spectrum fluctuations for /3-Hermite and /9-Laguerre ensem- 
bles via the tridiagonal matrix models introduced in and prove that the fluctu- 
ations describe a Gaussian process on monomials. We extend our results to slightly 
larger classes of random matrices. 

1 Introduction 

1.1 The Semicircle Law, deviations and fluctuations, numerically 

The most celebrated theorem of random matrix theory, the Wigner semicircle law |44( HH] , 
may be illustrated as in Figure ^ by histogramming the eigenvalues of a single random 
symmetric matrix using the simple MATLAB code (normalization omitted) 

A = randn(n); S = (A A')/sqrt(8 * n); a = hist(eig(S), [-1 : delta : l]); 
bar([-l : delta : l], a/(n * delta)) 



X 




Eigenvalue 



Figure 1: 25 bin-histogram of the eigenvalues of a 300 x 300 symmetric matrix from the 
real Gaussian distribution, versus the semicircle. 
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Every time we run this experiment, we obtain deviations from the semicircle. The 
difference from theory is readily explained computationally: 



• we use a finite matrix size n, while the semicircle law is a theorem about the n = oo 



• histograms bin eigenvalues into boxes of finite width, while the semicircle density is 
a continuous function. 

There can also be numerical error in the experiments due to finite precision computa- 
tions and truncation error, but in practice this does not appear to be significant. 

It is worth noting that the algorithm above is inefficient in two ways: first, it uses the 
full matrix A, rather than the equivalent tridiagonal matrix H/^ n of Table |21 (with /3 = 1), 
and it calculates the eigenvalues to obtain the histogram. For more on how to obtain the 
histogram plot efficiently and without calculating the eigenvalues, see Section |21 

To study the next order behavior in the law, for large n, we can subtract away the 
semicircle and multiply by n. The next order average behavior is what we call the deviation 
and it was first computed by Johansson |2j| to be 



This expression for the deviation is the /3 = 1 (corresponding to real matrices) instance 
of the more general P > case (which was also computed by Johansson, and will be 

explained in the next sections). The /3 > deviation contains a — 1^ multiplicative 

factor in front of the expression on the right of (pQ), which disappears for (3 = 1. 

One can see this deviation result as stating that as n — > oo the eigenvalues are decre- 
mented in the interior at a rate that is fastest at the center, pulling the eigenvalues toward 
the endpoints. 

Upon further examination of the next-order term, one can observe a phenomenon not 
appearing in the leading order term; there are fiuctuations around the mean. 

Each time we run a trial we can compute this fluctuation. For example, if we have 25 
bins, the random fluctuation vector v = (f i, . . . , ^25)"^ is the difference between the count 
in each bin and the number of eigenvalues predicted by the semicircle plus the deviation. 
The entries of v can vary quite wildly, but inner products with discretized smooth functions 
result in normal distributions in the continuous limit. Speciflcally, if fk is the vector of 
size 25 consisting of the evaluations of f{x) = on the centers of the bins, then the dot 
products f^v are heading towards Gaussians with covariances £'[(/^f )(/^w)]. 

Precise statements require the size of the matrix to go to infinity and the histogram 
to melt into a smooth density function so that the v vector becomes a Gaussian process. 

Denote by FLUCTU ATION{f{x)) the random quantity representing the limit ob- 
tained of f'^v where, as above, / is the vector of function values and v is the vector 
of histogram differences. For smooth functions, FLUCTU AT ION {f{x)) converges to a 
normal distribution; for example, in the limit as n ^ 00, 



limit; 



DEVIATION = -6-i{x) + -6i{x) - — 

4 4 ZTT 



\/ 1 — x"^ . 



(1) 




(2) 
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where 




, iffc = l mod 2, 
^(|)', ifA; = mod 2. 



For general /?, the right side of © gains a multiphcative factor of ^. 
The hmit of the entry (ki, ^2) of the covariance matrix becomes the covariance between 
FLUCTUATION{x^^) and FLUCTUATION{x''^) (see Theorem O with /? = !). 

1.2 /?-Hermite and /3-Laguerre ensembles 

This paper studies deviations and fluctuations in a wider context than real symmetric ma- 
trices with semicircular asymptotic density: We consider Hermite and Laguerre matrices 
with general parameter /3 > 0. For a great reference for these ensembles, see Forrester's 
upcoming book UHI ■ 

Classical finite random matrix theory considers the study of eigenvalue ensembles with 
joint density 



with w a scalar weight function on an interval /. This interval may be a subinterval of the 
real line, or the unit circle in the complex plane; other possibilities have been considered, 
too, and generalizations are easily conceived. A good reference for these formulae can be 
found in Mehta's book j^HJ- 

Some of the most studied eigenvalue ensembles have Hermite, Laguerre, and Jacobi 
weight functions on the real line, or uniform weight on the unit circle. In this paper 
we will be examining the ensembles with Hermite and Laguerre weights on the real line 
(respectively, half- line); see Tabled 

For more references on Gaussian ensembles, see |29| : for Wishart and MANOVA en- 
sembles, see [Sni; for Hermite, Laguerre, and Jacobi ensembles, see flK] , 

For three particular values of /?, namely 1, 2, and 4, these ensembles have been studied 
since the birth of the field, as the Gaussian real, complex, and quaternion ensembles 
(Hermite with /? = 1,2,4) of nuclear physics ( |44j . |45j . |14j . ^). Similarly, the Wishart 
real and complex (Laguerre with /? = 1, 2 and some restrictions on the Laguerre parameter) 
matrices emerged from the world of statistical multivariate analysis (UHl, HOI) 12411. 

The parameter /3 (making the connection to the Boltzmann factor of statistical physics) 
is seen by some communities (e.g. statistical mechanics) as an inverse temperature, or 
repulsion strength, of the ensemble of eigenvalues (the higher the /3, the more separated 
the eigenvalues) . It also has the advantage of the easy mnemonic of 1 , 2 and 4 corre- 
sponding to real, complex, and quaternion entries in the matrix models. However, some 
communities (like algebraic combinatorics) consider a different parameter, a = 2//3, which 
tends to simplify certain formulas (|27j, |36i)- In this paper we will use both notations, 
for convenience, and make sure that the reader is informed when changes take place. 

The reason for the attractivity and success that the study of Gaussian orthogonal, 
unitary, and symplectic (Hermite with /3 = 1,2,4), and the Wishart real and complex 
(Laguerre with /3 = 1,2), ensembles have enjoyed lies in the existence of matrix models 
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Table 1: Random matrix ensembles with eigenvalue distribution proportional to 
/(xi, . . . , Xn) oc YYi^j \xi — Xj\^ W w{xi) with w defined on an interval / CM. MANOVA 
stands for Multivariate ANalisys Of VAriance. 



Name 


parameters 


/ 


w 


Historical 
name/ constraints 


Hermite 


/3 > 


R 


w{x) = e~^^/^ 


Gaussian 
/3 = 1,2,4 


Laguerre 


/3 > 

a > (n - 1)1 


[0,oo) 


w{x) = x^e~^l'^ 
p = a - (n - l)f - 1 


Wishart 
13=1,2 
a = m^, m G N 


Jacobi 


/3 > 

01,02 > (n - l)f 


[0,1] 


w{x) = xP^{l-x)P^ 
Pi = ai- {n- 1)§ - 1 
P2 = a2- {n- 1)1 - 1 


MANOVA 
P=l,2 
ai = mi^, mi G N 
0-2 = 'm2§, m2 G N 



with real, complex, and quaternion entries (see [^H], ISO])- These models have not only 
originated the study, but have also allowed for a relatively thorough analysis of the finite 
and asymptotical eigenstatistics (statistical properties of the eigenvalues). 

One of the developments in the study of arbitrary /3-Hermite, -Laguerre, and -Jacobi 
ensembles is the introduction of general real matrix models (see |11| . |25| . |3H])- For every 
P, there are simple real tridiagonal matrices which model the corresponding eigenvalue 
distributions given by Table ^ For the /3-Hermite and -Laguerre ensembles, we present 
these forms in Table [2 These matrix forms allow for efficient Monte Carlo experiments 
and an alternative representation for the study of eigenstatistics in the general /3 case. 

This paper contains one example of such analysis. With the help these matrix models, 
we compute asymptotical global spectrum fluctuations for the /3-Hermite and -Laguerre 
ensembles. For the latter, these results are new in the general f3 context; the global 
spectrum fluctuations of complex Wishart matrices (/3-Laguerre with (3 = 2) were studied 
by Speicher et al. and before by Cabanal-Duvillard Johansson's more extensive 
study covers our results for the /3-Hermite ensembles. 

To prove our theorems, we use a very diverse set of methods and techniques from 
Jack Polynomial theory, special functions, perturbation theory, and combinatorial path- 
counting. Some of the techniques we used in studying the traces of powers of random 
matrices have been inspired by the work of Soshnikov and Sinai [SSI, 1^3, and by the 
study of traces of unitary random matrices by Diaconis and Shahshahani [Hj . 

1.3 Statements of results 

Among the asymptotical eigenstatistics one may distinguish two classes of properties: lo- 
cal (like the scaled distributions and fluctuations of extremal eigenvalues, or like the level 
spacing distribution, i.e., the distance between neighboring eigenvalues) and global (like 
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Table 2: Unsealed and sealed tridiagonal matrix models for the /3-Hermite and /3- 
Laguerre ensembles with any /? > 0, n G N, a G M, and a > ^(n — 1). 



Hermite matrix, 
unsealed 


/ iV(0,2) \ 

X(n~l)f3 N{0,2) X(n-2)/3 

V XP N{0,2) J 


Hermite matrix, 
sealed 




Laguerre matrix, 
unsealed 


= ^U^KnY^ ^here 

X/3(n-l) X2a-/3 
V X/3 X2a-/3(n-l) / 


Laguerre matrix, 
sealed 


fa 1 ja 



the limiting level density, i.e. the distribution of a random eigenvalue, and fluctuations 
thereof). The term global tends to refer to a property that involves all or a significant 
portion of the eigenvalues, while local tends to refer to a property that occurs near an in- 
dividual or a constant number of eigenvalues. Among the more famous local properties we 
enumerate the level spacings for the Gaussian ensembles (see [21], [SHI) and the extremal 
(largest, corresponding to "soft edge", smallest, to "hard edge") eigenvalue asymptotics 
(see jini, |22I) |2n])' ah these results relate to real, complex, or quaternion ma- 
trices; in the category of results relating to general /3, we have to mention recent work 
by Desrosiers and Forrester [Hj, where they analyze the asymptotical corrections to the 
eigenvalue density, and, for /3 E 2N, obtain the expected 0(n^/^) order in the fluctuation 
of the largest eigenvalue in the case of both /3-Hermite and -Laguerre ensembles. 

For n (the size of the ensembles) finite, the n eigenvalues may be considered as n 
fluctuating particles. Roughly speaking, the larger the /3, the less fiuctuation there is in 
the particles (hence /3 is seen as an inverse temperature). As /3 — > oo the particle positions 
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behave like multivariate normals with variance 0(l//3) and means located at the roots of 
a Hermite (respectively Laguerre) polynomial (see |12j). 

For a fixed /?, as n ^ oo and, in the case of Laguerre ensembles, 2a/(n/3) — > 7, the 
particles have an emerging (global) level density which is obeys a simple law (Wigner's 
semicircle law |45j for the Hermite ensembles, Marcenko-Pastur laws 28 for the Laguerre 
ensembles). The roots of the Hermite, respectively Laguerre, polynomial have this same 
asymptotical density - the fluctuations do not change the asymptotics, as they are on a 
smaller scale. 

In jni) we have proved convergence almost surely (as n — 00) of the asymptotical 
eigenvalue distribution of the /3-Hermite ensemble to the semicircle distribution S with 
density ^Vl — x'^, and of the asymptotical eigenvalue distribution of the /3, a-Laguerre 
ensemble (with nf3/{2a) ^ 7 < 1) to the Marcenko-Pastur E^/ distribution with density 

(v^ i)J((v^+i) -^Yg recall that convergence almost surely is stronger (and 

implies) convergence in distribution, a.k.a. convergence of moments. 

n 

We examine here the distribution of the statistic ^ fiK), where / is a function of 

i=l 

the scaled eigenvalues Aj. The scaling is A ^ ^/2ri/3X for the /3-Hermite ensembles and 
A — > nP/'jX for the /?-Laguerre ensembles (see Table EJ. 

General /? results for this kind of statistic can be found in [H] , |5J ; this or similar linear 
statistics have been considered also in |15j (for unitary matrices), [HSl (for Wishart ma- 
trices), and, heuristically, in Another path of interest is represented by asymptotical 
large deviations from the density (spectral measure); we mention the results of [Ij, [2j, 
[T8) (which also covers global fluctuations for Wishart matrices) , |19| . [7] (which covers 
moderate deviations). 

For the linear statistic Y17=i /(-^«)> the Wigner and Marcenko-Pastur laws for the 
/3-Hermite and /?-Laguerre ensembles state that for any "well-behaved" function /, 



^ " 2 

- y^/(Ai) — /(t)\/l — t"^ dt , respectively, 



(3) 



n ^ 2^7y(^_i)2 t 

where the convergence in the above is almost surely^. 

Examining the fluctuations from these laws takes us one step further. For / a polyno- 
mial, we prove that once we subtract the expected average over the limiting level density 
(i.e. the right hand sides of © and @), the rescaled resulting quantity tends asymptot- 
ically to a normal distribution with mean and variance depending on /. In other words, 
once the semicircle or Marcenko-Pastur distributions are subtracted, the fluctuations in 

n 

the statistic ^ /(Aj) tend asymptotically to a Gaussian process on polynomials /. 

2 = 1 

How much more we can extend the class of functions that this process is well-defined 
on depends on the entries of the covariance matrix C = {Cij} (which can be expressed 
in any polynomial basis). One would be tempted to believe that a Gaussian process W 

^Such a result is sometimes given the name of "Strong Law of Large Numbers"; if convergence is only 
in distribution, the name becomes "Law of Large Numbers" . 
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defined on polynomials could, in principle, be extended to a class of continuous functions 
h{x) with the property that, given a sequence of polynomials Pnix) h{x) in some norm, 
given Vn = Varw{pn), Vn ^ v < oo, such that v = Varw{h). 

Definition 1.1. Let 7 € [0, 1] be a real parameter, and let a = (^/7 — 1)^, b = (^/7 + 1)^. 
We define the following two measures: 

^^(^) n^i(-)+i^-i(-)-2^7T^' 

I , otherwise. ' 

, otherwise . 

Theorem 1.2. Let Hp^n be a scaled matrix from the [3-Hermite ensemble of size n, with 
(scaled) eigenvalues (Ai, . . . , A^), and let k > 1 be a positive integer. For all i = 1, . . . , k, 
let 

Xi = triiHp^nT) - n ^.^^ % ^od 2),o " (| " ^) / ^ t'finit) dt, 

= XJa;- - n ;^y"^ tWl-t^ dt - - 1^ ffinit) dt . 
Let (Yi,Y2, . . . , Ifc) be a centered multivariate Gaussian with covariance matrix 



Cov{Yi,Yj) 



Then, as n —>■ 00, 



i^mDif) , ^f ^ = J=0 mod2; (5) 



2 

, otherwise. 



{Xi,X2,...,Xk)^ ^ {Yi,Y2,...,Yk) . 

Remark 1.3. For any size k, if we add a first row and a first column of zeros to the 
covariance matrix ©, the resulting (k + 1) x (k + 1) matrix has a scaled Cholesky 
decomposition as = T}^Dj^T^ , where is the diagonal matrix having on the diagonal 
the vector (0, 1, 2, 3, ... , k), and Tk has an interpretation as the change-of-base matrix in 
the space of univariate polynomials from monomials basis to the Chebyshev polynomials 
basis. One can also look at the infinite version Coo = T^oDaoT^. 

Remark 1.4. This Gaussian process is extended to a larger class of continuous functions 
in 121}/ . Johansson conjectured that the regularity conditions imposed were purely technical, 
and that in fact the correct condition should be that the function h admits a Fourier-like 
expansion in the Ghebyshev basis, which we write as an (infinite) vector h, such that 
JrDooh, the variance of h under the process, is finite. This conjecture is equivalent to 
saying that the process could be extended to any function f such that f^Coof is finite 
(since we have chosen the monomial basis as the representation, rather than the Chebyshev 
polynomial basis). 
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Theorem 1.5. Let L'^^ he a scaled matrix from the (3-Laguerre ensemble of parameter 
a and size n, with (scaled) eigenvalues (Ai, . . . , A„), and let k > 1 be a positive integer. 
Assume that nj3/{2a) ^ 7 < 1, and let "ymin = {^/y — 1)^) Tmax = {^/y + 1)^- For all 



-1 



= - ^ ty{t--lmin){nmax-t) dt - ho " 1 / ^VlW 

Let {Yi,Y2, . . . , Ife) he a centered multivariate Gaussian with covariance matrix 

Cov{Yi, Yj) = TD{i,j) + Ts{i,j) , (6) 



where 



rs 



r I \s 



and 



q=l ^ l=q+l U-1 ; ^ + s = l 

1 < r < i 
l<s<j 



r / V s 



Then, as n ^ oo, 



< r < z - 1 
< s < j - 1 



{Xi,X2,...,Xk) jI {Yi,Y2, . . . ,Yk) . 



Remark 1.6. For any size k, if we add a first row and a first column of zeros to the 
covariance matrix the resulting matrix C should admit a scaled Cholesky decomposi- 
tion of the form = Tj^D^T^ , with being the diagonal matrix with diagonal entries 
{i7~"'}o<i<fc; o.'f^d Tfc being the change-of-basis matrix in the space of polynomials from 
monomial basis to the shifted Chebyshev polynomials of the first kind (as defined by 
Cabanal-Duvillard ^ and used in \2bl/). Note that the constant c in TW^ is our I/7. It 
may also be useful to look at Coo = TooDooT^^- 

Remark 1.7. It is worth noting that results like Theorems and \1.5l where one aver- 
ages over a set of quantities, then subtracts the mean and scales by the variance to obtain 
a limiting Gaussian, are sometimes called central limit theorems (see for example 
and Free Probability uses this term as well, see for example \SS> ^'^ different 

context, namely, to express the fact that averaging over random matrices creates an eigen- 
value distribution that approaches the semi-circular law. Both uses of the term "central 
limit theorem" draw different parallels to the classical case. 
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Our approach to proving Theorems 11.21 and 11.51 consists of computing the first-order 
deviation from the mean (in Section IS} , showing that the centered process is Gaussian 
on monomials (by the method of moments), and computing the covariance matrices (in 
Section 13). 

Finally, in Section |1J we generalize our approach to two different classes of random 
matrices. 



2 Deviation from the semicircle and Marcenko-Pastur laws 

2.1 Dependence on (3: symmetric functions and the "palindrome" effect 

As stated in the introduction, we are interested in computing the deviation to the semicir- 
cle and Marcenko-Pastur laws (denoted below by L AW {co), as opposed to the LAW{n), 
which is the level density for finite n). These deviations have the form 

LAW(n) ~ LAW(oo) + - f 1 - | ) DEVIATION(p = oo) + o ( - 

n \ p J \n 

as n ^ oo. 

By integrating the above against x^, we can write this in the moment form 
momentk{n) = moment (oo) -\ — ^1 — — ^ momentk{DEVIATION{f5 = oo))-|-o > 
again as n — > oo. 

We mention two interesting points, the first of which we prove in this section: 

1. The factor ^ — 1 can be obtained from a symmetry principle alone. It is a direct 
consequence of Jack Polynomial theory that the coefficient of in momentk{n) 
is a palindromic polynomial (we define "palindromic" below) in — ^, and from the 
tridiagonal matrix models it follows that the degree of this polynomial is j; thus 
when j = 1 this polynomial must be a multiple of ^ — 1. Mathematically, this is 
significant because because in order to study the deviations, it is sufficient to then 
study the non-random case, (3 = oo. In summary, the powerful Jack Polynomial 
theory allows us to take a complicated random matrix problem and reduce it to an 
exercise on the properties of univariate Hermite and Laguerre polynomials. 

2. With the Maple Library MOPs we can compute symbolically the exact values 
of momentk{n) for small values of fc, as a function of n and (3. In other words, 
while this paper concerns itself with the constant and 0{\/n) behavior, it is worth 
remembering that higher order terms are in principle available to us. 

To make notation a bit clearer, we have used the a = 2//? in the below; we also recall 
the scaled matrices Hp^n and L'^^, from Section ll.2l The following are the first three 
non-trivial moments for the traces of the scaled /3-Hermite and /3-Laguerre matrices with 
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a = n/3/(27) = n/{aj). We omit the n and a in the notation for reasons of space. 



1 4 2 5a — 5 Sa^ — 5a + 3 

- i?[tr(/?2/J] = + + 1^;^2 ' 

1 pr /H-6 M _ 5 11a -11 16a2-27a + 16 15a3 - 32a2 + 32a - 15 

- i?[tr(L2/a)] = 1, 

n ' n 

1 , 2x 37(7 + l)(a-l) 72(2a2-3a + 2) 

n ' n 

Note that the 0(1) terms in the above correspond to the second, fourth, and sixth 
moments of the semicircle, in the Hermite case, respectively, to the first, second, and third 
moments of the Marcenko-Pastur distributions in the Laguerre case; these are Catalan 
numbers (scaled down by powers of 4 because of the semicircle [—1,1] normalization), 
respectively Narayana polynomials in 7. 

The 0(l/n) terms, the moments of the deviation, are always multiplied by a — 1, 
while the other coefficients of the negative powers of n in the above are "palindromic 
polynomials" of (—a); we recall the definition below. 

Definition 2.1. A classical "palindromic polynomial" is defined by the fact that its list 
of coefficients, is the same whether read from beginning to end or from end to beginning. 

Remark 2.2. An odd-degree palindromic polynomial in x is a multiple of (x + 1). 

To prove that the dependence of the first-order term in the deviation is indeed a 
multiple of a — 1, i.e., of ^ — 1, we will use elements of Jack polynomial theory, and also 
a stronger form of a duality principle proved in U0|. 

We introduce below two notational conventions to be used throughout the rest of the 
paper. 

Definition 2.3. We denote by E^[P{xi, . . . ,Xs)], respectively E^^[P{xi, . . . ,Xs)], the ex- 
pectations of the polynomial P over the scaled 2/6-Hermite, respectively 2/6, a -Laguerre, 
ensembles of size s. 

We denote by £^[P{xi, . . . ,Xs)], respectively £^^[P{xi, ... ,Xs)], the expectations of 
the polynomial P over the unsealed 2/ 5-Hermite, respectively 2/6,a-Laguerre, ensembles 
of size s. 

Let R[ xi, . . . , Xn] be the space of symmetric polynomials in n variables (by symmetric 
we mean invariant under any permutation of the variables). A homogeneous basis for this 
vector space is a set of linearly independent, symmetric and homogeneous polynomials 
which generate M[a;i, . . . , One such basis is given by the power-sum functions, defined 
multiplicatively below. For reference, see 22 ^iiid |27j . 
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Definition 2.4. Let A = (Ai, A2, • • • , A„) denote an ordered partition (Ai > A2 > • • • > Xn)- 
We define the power sum functions by 



PA, 



E 



X 



J 



and 



PA1PA2 



■PXn 



The Jack polynomials J" constitute a parameter-dependent (the parameter being 
usually denoted by a) class of orthogonal multivariate polynomials; they are indexed by 
the powers of the highest-order term A (in lexicographical ordering). 

Throughout this section, we will think of the parameter a as a sort of inverse to /3 
(recall that we have denoted a = 2//3). 

The Jack polynomials allow for several equivalent definitions (up to certain normal- 
ization constraints). We will work here with definition 12.51 which arose in combinatorics. 
We follow Macdonald's book |77j . 

Definition 2.5. The Jack polynomials J" are orthogonal with respect to the inner product 
defined below on power-sum functions 



a 



1{X) 



l(X) 



where zx = Y\ CLi\i"'\ being the number of occurrences of i in A. In addition, the 

i=l 

coefficient of the lowest-order term in , which corresponds to the partition [l''^'] = 
(1,1,...,!) (of length \X\), ts \X\\. 

From now on, we will use the notations /„ for the vector of n ones and we will refer 
to the quantity J"{xi, . . . , Xn) / J" (In) as the normalized Jack polynomial. 

To prove our results, we will need the two lemmas below, the first of which is a stronger 
variant of the duality principle proved in (lOj as Theorem 8.5.3 (the proof is virtually the 
same as in jlUj and we will not repeat it here) . The second one is a rewrite of a particular 
case {za = and formula (4.14a)) of formula (4.36b) in [Hj. 

Lemma 2.6. Let k' denote the conjugate partition to n (obtained by transposing the rows 
and columns in the Young tableau). Then the following is true: 



E, 



H 



J°(X1, 



Lemma 2.7. The following identity is true: 



/a 



Jll^ivi, ■■■,ym) 



Jk. (^1; • • • ! ^n) 
J^{In) 



In particular, for a = 



J-(xi 



n 



n 



X 



a 



+ y 



n 



a 



+ y 
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We can now prove the two main results of this section. 
Theorem 2.8. For k an even integer, let 

k 

1 ^ 
j=0 

with p[i^^ being the power-sum corresponding to partition [k]. Then f{a,j) is an integer- 
coefficient polynomial ml/a of degree at most k/2 such that 

/(«,j) = (-ar^+^/(l/a,j) . 

Remark 2.9. When we scale the ensembles, we have to multiply the expectation by 
(2ra/3)~^/^ = (^)''^^, which means that 

k 
2 



1 1 

3=0 



The corollary below follows. 



Corollary 2.10. It follows that g{a,j) := af'/'^ f{Q.,j) is an integer-coefficient polynomial 
of a for which 

9(.a,j) = {-ayg{-,j) , 
a 

so the degree of g{a,j) is at most j. This yields, in particular, g{a, 1) = Cfe(a — 1), with 
Cfc a constant depending on k. 

Similarly, for the Laguerre ensembles, we have the following theorem. 

Theorem 2.11. Let a = —, and 

k+1 j-l 



j=l r=Q ^ 

with being the power-sum corresponding to partition [k] . Then f{a, j, r) is a polynomial 
in 1/ a of degree at most k such that 

f{a,j,r) = {-ar''-^+'f{l/a,j,r) . 

Remark 2.12. When we scale the ensembles, we have to multiply the expectation by 
i^np) ~ (in)'^' which means that 

j=l r=0 

The corollary below follows. 
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Corollary 2.13. It follows that g{a,j,r) = f{a, j,r) is an integer- coefficient polyno- 
mial of a for which 

g{a,j,r) = {-a)^-^+^g{l/a, j,r) , 

so that the degree of g{a, j,r) is at most k — j + 1. This yields, in particular, g(a,k,r) = 
Ck,r{a - 1). 

Proof of Theorem 12.^ Note that 1/a = 2/(3 and that P[k]{xi, . . . ,Xn) =tr(X'^), for any 
matrix X with eigenvalues xi, . . . By using the unsealed matrix model i^/3,n for the 
/3-Hermite ensembles found in Table |21 one can obtain, as in Application 3 of (more 
precisely, from Corollary 4.3), that £^\p^f,^{xi, . . . , Xn)] is an integer-coefficient polynomial 
in 1/a of degree k/2, and a polynomial in n of degree k/2-\- 1. Hence f{a,j) is an integer- 
coefficient polynomial in 1/a of degree at most k/2. 
Let us now express pj^j in Jack polynomial basis: 

Ahfc 

omitting the variables for simplicity. 

Let Q(q!) be the field of all rational functions of a with rational coefficients. 

Let A X Q{a) be the vector space of all symmetric polynomials of bounded degree with 
coefficients in Q(a). 

For every ^ 6 £ Q(a), define the Q(a)-algebra automorphism a;^ : A x Q(a) — > A x 
Q(a) by the condition coglpf^) = {—l)^~^0pk, for all k > 1. This family of automorphisms 
appears in [27\ Chapter 10], and similarly in [S^. In particular, u; = wi is known as the 
Macdonald involution (and can be found in [271 Chapter 1]). 

We will use the following formula due to Stanley [HE] which can also be found as 
formula (10.24) in [27|: 

a;„J: = al^ljy-, (8) 

where A' is the conjugate partition of A (obtained from A by transposing rows and columns 
in the Young tableau). 

The first step of the proof is given by the following lemma. 

Lemma 2.14. The coefficients in equation ((7| satisfy 

cx{a) = (-a)i-'^cv(l/a) • 

Proof. We apply coa to both sides of 0, and use the fact that is linear, together with 
Q, to obtain that, on one hand, 

^aP[k] = (-l)''"^aP[fc] = (-1)^"^" J^CA(l/a)J;I^" , 

Xhk 

and 

Ahfe 
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on the other hand. 

Since there is a unique way of writing uiaP[k] in Jack polynomial basis, it follows that 

CA(a) = (-Q)i"'=cv(l/a) . 



□ 



Remark 2.15. Note that Lemma \2. 14\ does not say anything about expectations. 

We now write the expectation of over the unsealed Hermite ensemble using (jT)): 



£a [P[k]{xi, ■■■,Xn)] = ^CA(a) J"(/„) £, 



We know (for example from |^ page]) that J^{In) = Yli^x y)£x(^ — x + ay); hence 

J\ {^1 5 • • • ) ^n) 



[Plk] {xi,...,Xn)] =J2^^ 



Ahfc 



Yl (n-x + ay) . (9) 



By Lemma l'2. 61 £^ 



J^{xi,...,Xn) 



does not depend on n. Write 



JJ (n-x + ay) ='^bx{j,a) 



n-' 



we have 



and consequently 



(n-x + ay) = J| {n + a{y )) , 

ix,y)ex {y,x)eX' 



(10) 

Using the above, Q, Lemma l2.14( Lemma l2.6| and substituting | + 1 — j for j (in 
the power index of n), we obtain the statement of Theorem 12.81 □ 

Proof of Theorem \2.11\ The fact that f{a,j,r) is a polynomial in 1/a of degree k follows 
similarly to Corollary 4.3 in Application 3 of 11^. 
We write 



Sa,a[P[k]ixi,...,Xn)] = ^CA(a)£:^^„ 



Xhk 



/a (^1) • • • ) ^n) 



JJ (n-x + ay) , (11) 



(x,y)eX 



E-A(a) n + n (n-x + ay). (12) 



Xhk {x,y)eX 



{x,y)eX 



If we write 



'7 



it is not hard to see that 

hU,a) = by UA/ a) ■ (13) 

Using ()1U() , H13|) , and Lemma 12.141 we obtain the statement of Theorem 12.111 □ 
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2.2 Computing the /5-independent part of the deviation 

In this section we will examine the deviation at /3 = oo for the Hermite and Laguerre 
ensembles. In proving this we employ a simple differential equations trick that will allow 
us to compute the zero and first order terms in the mean of the eigenvalue distributions 
at /3 = oo. 

Given a function y{x) which satisfies the second order homogeneous differential equa- 
tion 

f{x)y"{x) + g{x)y{x) + h{x)y{x) = , 
denote by m{x) the function m{x) = y'{x)/y{x) (with poles at the zeroes of y{x)). 

Proposition 2.16. The function m{x) satisfies the first- order differential algebraic equa- 
tion 

(^) + + + m [x) 



. 



The proof is immediate. 



Remark 2.17. // the function y{x) is a polynomial with a finite number k of distinct 
roots, m{x) is the generating function for the powers of y 's roots. 

Let -ff/3,n and Lp ^ denote matrices from the /3-Hermite, respectively, /3-Laguerre en- 
sembles. As in |10j . to obtain the deviation, we once again will examine the averaged 
traces of powers of the matrices Hj^ n and L'^^, this time looking at the first-order terms. 

As a consequence of Corollaries I2.1UI and 12.131 for k an even positive integer in the 
Hermite case, and k an arbitrary positive integer in the Laguerre case with parameter 
a = (n/3)/(27), 



1 



n 



2 



1 



+ 0{n 



n 



while 



1 



E'^p'" [p[k]{xi,...,Xn)\ = Cfc(7) +4(7) 



/3 



1 



+ 0(n 



-2\ 



n ^ n 

It follows that the deviation is given by the moments c\, respectively, 4,(7), times the 
scaling factor — \. Equivalently, if we examine the generating functions 



m(n, /3, x) 



1 



n 



1 



A,; 



n 



k=0 



E 

k=0 



1 



Ck _^ 



0] 



r-fc+1 



+ ... 
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and 



m{n, a, (3, x) 



n 



E 



1 



X 



n ^ 



1 X\ 



0] 



fc=0 



E 

fc=0 



rk+1 



+ 



n 



2 



4(7) 



+ ... 



then in order to find the first order asymptotics of m(n, /3, x) and m(n, a, /5, j;), it is enough 



4(7) 



to compute respectively, 

k=0 k=0 

We will do this by keeping n fixed, letting /? — > cxd, and computing the zero and first 
order terms in - in the generating function of the resulting matrix ensemble. 



Remark 2.18. Knowing that the quantities [p[k]{xij • • • > ^n)] CLnd ^E^'"" [p[k]{xi, . . . , 

are polynomial expressions in ^ and ^ (which follows immediately by rescaling the results 
of Theorems \2.8\ and \2.ll\) is the key factor in allowing us to let (3 ^ oo while keeping n 
fixed. 

To aid us in our calculations, we will make use of a beautiful property of the Xr 
distribution, which will allow us to replace our tridiagonal models with even simpler ones. 
We give this property Proposition. 

Proposition 2.19. Let {Xn}n£N be a set of random variables with distributions Xrn, 
n £ N, such that r^ ^ oo as n ^ oo. Then the sequence {Xn — y^jneN converges in 
distributions to a centered normal of variance 1/2. 

It follows then that, as we have already shown in that given n and 7 < 1 fixed, the 



entries of the scaled random matrices H/^ n and L'pn~ ^0 ni-^p n) (with a = {njS) / (27)) 

\ 



converge as /5 ^ 00 to the following models: 



/3,n 



respectively to 



H 



2^ 







n 



n 











/ 



(14) 



7 ^ 



7 



1 aA 



^ - 1 0r 



(15) 



-2 V2 



-n + 2 V2 



^ -n + 1 

7 



-n + 2 
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Note also that if we denote by i?^ „ the matrix such that = B'^ ^{B'^ ^)^, then 



Bp,n 



B^ 



n _ 1 

7 



and that = B^B'^. 



V2 



n + 2 



7 



Since H, Lj, and are non-random matrices, all expectations are exact. 



(16) 



2.2.1 P = oo, Hermite case 

The matrix H (see (|14j) ) has as eigenvalues /ii/\/2n, . . . , hn/V^, where hi, . . . ,hn are the 
roots of the nth Hermite polynomial Hn{x) (this can be easily deduced from the three- 
term recurrence for the Hermite polynomials, see for example 02] )• For a more detailed 
description of the properties of this matrix, see |12j . 

It follows that the generating function we need to compute is 



1 " 1 
rh(n,x) = —} r- 



1=1 



2n 



We use the well-known identity 



n ^ 

5^ T 



X — Xi p{x) 



(17) 



where Xi are distinct values, and p{x) is the polynomial whose roots the XiS are, to obtain 
that 

H'^{xV2^) 

and by applying Proposition I2.16L we get that m{n,x) satisfies the differential equation 



/ / \\2 A /■ \ A (n, X) 

[m{n, X)) — 4xm[n, x) + 4 -\ = 



n 



(18) 



Writing m{n,x) = mo(x) — ^'mi{x) + 0{n ^), we obtain from H18|) that 

mo{x) = 2 (^x — \/ x"^ — 



mi{x) 



(^x — \Jx'^ — 1^ 
2(x2 - 1) 
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Computing the inverse Cauchy transform for mo{x) and mi{x) yields the semicircle 
distribution and, respectively 

=n*'''<"' 'f-^i-i'ii- (19) 

I , otherwise. 
We have thus proved the following result. 
Lemma 2.20. For any polynomial P, 

-1 /n \ rl 



n / P(x)s(x)dx — > It; — 1) / P(x)nH{x)dx 
J-i V/3 / 7-1 



as n —> oo. 



Remark 2.21. a side note, in the computation above we have provided yet another 
way to obtain the semicircle law for all (3. 

2.2.2 /? = oo, Laguerre case 

The matrix has as eigenvalues 7/1 /n, . . . , 'jln/n, where li, . . . ,ln are the roots of the 

rath Laguerre polynomial Ln^^^'^ (see (this can be easily deduced from one of 

the many recurrences for Laguerre polynomials, found for example as (26) in [IS])- To 
get a more detailed description of the properties of this matrix, refer to J2j, substituting 
n(l/7) — 1 for 7. 

Since we need to rescale the eigenvalues by an additional n, it follows that the quantity 
of interest is 

1 " 1 
i(n,7,x) = -Yl ^ ' 



i=i 



Once again we use identity El and obtain that 

m(n, 7, x) 



^n(l/7-l)V (^n 



n r"(l/7-l) 



xn 



and by applying Proposition l2.16( we get that m(n, 7, x) satisfies the algebraic differential 
equation 

7 (m(n, 7, x)f - rh{n, ^, x) ( 1 - ^ + 1) + ^ + 7^ [n,^,x) + ^ ^ ^ ^ 

\ X x J x n 

Writing rh{n,'j,x) = m(j{x) — -mi(x) + 0(n~^), we obtain from H2U|) that 



x + j-l-^jx-i^- iy)ix - (^ + 1)^) 

mo(x = 

27X 

^ - 7 - 1 - - (V7 - 1)')(^ - (^/7 + 1)') 



mi{x) 



2{x-{^-iy){x-{^ + iy) 
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By calculating the inverse Cauchy transforms of mo(x) and mi(x), one obtains the 
Marcenko-Pastur distribution, respectively 



t.y{x) 



l6,{x) + l6a{x) - ^-7=i_ , if E [a, b] , 
, otherwise , 



(21) 



witha = (V7-l)', ^ = (^+1)'- 

We have thus proved the following result. 

Lemma 2.22. For any polynomial P, 



E 



I3,a 



i=l 



n / P{x)e^{x)dx 



1 / P{x)ty{x)dx 



as n ^ oo. 



3 Fluctuation of the semicircle and Marcenko-Pastur laws 

In this section we compute the fluctuations terms for the /3-Hermite and /3-Laguerre en- 
sembles; we show that the fluctuation of the trace of any given power of the matrix 
corresponding to the ensemble tends to a Gaussian. 

The essence of the argument is simple. We will think of the random matrix as the 
sum between the non-random matrix of means (which we can also think about roughly 
as the P = oo non-random matrix), and a random matrix of the centered entries, and do 
some obvious computations of traces of powers. Much of the work goes into the technical 
carefulness to provide a complete argument, but the reader should not let this detract 
them from the simplicity of the idea, which is based on formula which gives, for a matrix 

tl(T ) = ^ ^ ijii2^i2«3 • • -^ikh ' 

with the proviso that the above sum is especially simple when the matrix T is tridiagonal. 

We provide here a heuristic explanation for the Hermite case, just for the purpose of 
emphasizing that the main idea is simple. 

One can think of the random matrix, for all practical purposes, as 

T K, Tp=oo H , 



where G is symmetric tridiagonal matrix of 0(l)-variance Gaussians (similar to the de- 
composition we used in [T^), with entries which are mutually independent, up to the 
symmetry condition. 

Then for any polynomial /i, 

tr(/i(T)) « tr(/i(r^=oo)) + ^tr(/i'(T^=oo)G); 

the second term is clearly normally distributed, and all we have to do is compute the 
variance and show it is finite (which we can achieve by examining the cases when /i is a 
monomial) . 
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It is worth noting that, in principle, one should be able to do this computation for 
continuously differentiable functions h with some additional conditions imposed by the 
fact that the variance needs to be finite. 

The technicalities arise because n — > oo and the equalities above are just approxima- 
tions, but this should not detract from the main idea. As we will see, using the method of 
moments will show that we do not need G's entries to be Gaussian (or even approximately 
Gaussian) in order for the fluctuation to monomials h to be Gaussian. 



3.1 The /3-Hermite case 

We write the scaled matrix H^ n as 

= A + -^Y , (22) 
\/np 

where A = Ej^[Hf^^n\ is the symmetric tridiagonal matrix of mean entries {A{i,i + 1) = 
2^^ ^[X{n-i)f3]'i s-ll other entries are 0), and Y is symmetric tridiagonal matrix of centered 
variables (with a diagonal of independent Gaussians of variance 1/2). Technically, A and Y 
depend both on n and on /3; we drop these indices from notation for the sake of simplicity. 

Remark 3.1. From Provosition \2. lf\ we know that if A = {aij)i < i j < n' then for any 
e > 0, there exists G N such that 

(2y/ni3 ai^i+i - \/ {n - i)P^ < e for any i < {n - i^) , 

and that N{0, 1/8) in distribution as i ^ oo, while Yi^i ~ A^(0, 1/2) for all i. 

Remark 3.2. (bounded moments) Note that the entries of A are bounded, both from 
below and from above; we will think of them as 0{1). Similarly, for any k and I finite, we 
know from the above that there exists an M such that 

kl 

E([lY-^^<M, 

i=l 

for all < Cj < kl, and for all ji, ■ ■ ■ ,jki and j'l, ■ ■ ■ such that \ji — < 1. 
Given integers k and n, consider the random variable 

u;fc(n) = tr(F^_J-£;f[tr(#|^J] . 
Claim 3.2.1. For any fixed integers k and I, 

lim E[{uJk{n)y 



(l) 5(feTT)T k^''^ {\)\ ifk' ^ even, 
(f) W ^^'^ (ti) 1 if k is odd and I is even , 



0, if I is odd. 



20 



Claim 3.2.2. For any fixed integers ki and k2, 

i& (llli) . ^fki, k2 are odd, 



lim Cov{uk^{n),u]k2{n)) 



I ^feTTEi+2 |^^(^)(|) , «/A;i,A;2 are even, 
, otherwise . 



Remark 3.3. Claims V3.2.1\ and \3.2.}l\ show that the centered fluctuation of the (3-Hermite 
ensembles describes the Gaussian process on monomials defined in Theorem M.lX 



For the remainder of tliis section, we will prove Claims 13.2.11 and 13.2.21 To give the 
reader a rough idea of where the calculations will lead, we provide below an intuition of 
what we will be doing. 

Intuitive explanation. The first step is to note that tr{T^) is a sum of products of k 
entries of T; for a tridiagonal matrix T = (tjj)i<ij<n with tij = if |i — j| > 1, 



tr(T ) — ^ ^ ^«l,«2^«2,«3 ■ ■ - ^i 



l<il,...,ifc<n 

where the sum needs to be taken only over the sequences ii, . . . ,ik such that \ij—ij^i\ < 1, 
for all J = 1, . . . , — 1, and also \ik — «i| < 1- 

We have a sliding "window" of size k down the diagonal of the matrix T in which we 
take products of powers of the elements. In particular, for the matrix A, this is easy to 
visualize, because with the exception of a finite bottom right corner, the entries of A in 
any finite window look roughly the same. 

The second step is to identify the significant terms, i.e. the terms that have non-zero 
asymptotical contributions. Roughly speaking, these will be the terms which will contain 
precisely one element of Y, and all the others from A. Nothing surprising here, as Y is 
scaled by (see (jUJ). 

Finally, we will compute the contribution from the significant terms and show it agrees 
with the result of Claim IT.2. II Then we will note that the same reasoning yields the result 
of Claim Em 

We now proceed to make the above intuitive description rigorous. We need to introduce 
some notation. 

Definition 3.4. For given n and p, we denote by Sn,p C {1, . . . ,n}P the set of sequences 
of integers ii, . . . ,ip such that {ii, . . . ,ip) G {1, . . . , n}^ and \ij — ij+i\ < 1 for all j = 
1, . . . ,p — 1, and also \ip — ii\ < 1. 

We denote by I an element of Sn,p, and we denote by 

'■= iii,i2^*2,«3 ■ • • ^ip,h ' 

where (ii, • • • > ip) ='■ I- 

For a given T £ Sn,p, note that we can break up the sequence (ii, . . . , ip) into concate- 
nations of sequences 

J = ((Vo; • • • ) ^Pl)) (V2' • • • ) ^Pa)' (V4 • • • Vs)' • • • ! (^P29' • • • ) V2q+l)) ' 
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and 

R = {{ipi ) • • • ) (Vs' • • • ) ^^4)1 • • • ) ) • • • 5 V29)) 1 

such that in each of the sequences ^2*;' ^2*:+!' • • • ' ^P2fc+i (^^^ = 0, . . . , which form J, 
consecutive indices differ by exactly 1, and in addition to this 

(ii , . . . , ip) = (ipo , • • • , ipi , ipi+i ) • • • ) V2 1 V2+1 1 • • • ) 1 • • • ) V213+1 ) • 

We aUow for the possibihty of having empty sequences ip^, . . . , ip^ in the beginning 
and/or ip^^^ . . . , ip2,+i in the end of J. 

Remark 3.5. To give an intuition for these sequences, note that any term in tr{{Hj3^n)'^) 
can be thought of in terms of products of entries from A and entries from Y ; the sequences 
J and R will he overlapping "runs" recording the former, respectively the latter. 

Also note that, given a fixed I, each J satisfying the requirements above has exactly 
one R = R{J) corresponding to it, and that to different Rs correspond different Js. Fur- 
thermore, since k is finite, a given concatenation of sequences R may corresponds only to 
a finite number of sequences X (since all indices must be within k of each other!). 

Definition 3.6. We define the set J as the set of pairs (J, i?) described above. For a 
tridiagonal matrix T , we define 



P3 ^'^P3 



. . . ti. 



«2q,j2<j + l • • • ''j2<j+l — l,j2<j+l 



similarly. 



For any sequence I £ Sn,p, we can write 



2g-l,«29-l+l • • • ^i2q~l,i2q 



(23) 



with P being the total length of the "runs" in the sequence R (i.e., P = {p2 — pi + 1) + 

(P4 - P3 + 1) + • ■ ■ + ip2q - P2q^l + l))- 

We have now enough information to start the proof of Claim 13.2.11 



Proof of Claim [^.2.1i First we examine 



Eiiukin))'] = E 



tr{HU-E^MM)] 



note that 

I 

E[Mn)y] =E[ Yl n {(^P,n)j, - i^[(i^/3,n)x, 

Il,...,Il&Sn,k i = l 

Using (|23jl . we write 



E[Mn)y] 



E 



1 < J <l 



.(24) 



22 



Since ^ is a non-random matrix, it follows that we can rewrite (|24|) as 



1j e Sn.k (Jj'Ri)^J'j V= 
1 < J <l 



l[{iY)R,-E[iY)n,]) 



(25) 



Denoting by q := Yli=i -fj/^i we obtain that 



E E 

1 < J < / 



E 



.(26) 



Lemma 3.7. The non-zero terms in ()26|) have q > 1/2. 

Proof. If any of the / terms in the product nj=i((^)^?i ~ -^[(^)rJ) involves only variables 
that are independent from all other variables appearing in the remaining / — 1 terms of 
the product, the expected value of the product is 0. This includes the case when at least 
of the RiS are empty. Hence, in all of the terms that have non-zero contribution to the 
expectation, each Ri must be nonempty, hence 2q = X]!=i Pi ^ I- D 

Remark 3.8. Note that in fact something stronger follows, namely, that for each 1 <i < 
I, there exists an 1 < j < n, j ^ i, such that the some variable appearing in {Y)ji. also 
appears in {Y)^. . 

Since the entries of ^ are 0(1), and k and / are fixed, the factors nj=i(^)^i 0(1). 
By Remark 13.81 and Lemma 13.71 it follows that every term with non-zero contribution in 
the double sum corresponds to an /-tuple {Ri, . . . , Ri) with the property described 
in Remark 13.81 Each such /-tuple comes with a weight proportional to 1/n'^. 

To compute the asymptotics of the sum, we will do the following thought experiment: 
select a non-zero contribution term and draw the "correlation" graph with Ri as vertices, 
and an edge between Ri and Rj if and only if {Y)ji. and {Y)ji- are correlated. The 
resulting graph will have s connected components, with 1 < s < [^J . 

Call these connected components Ci, . . . , C^, and consider the set of variables Vi, . . . ,Vs, 
such that f S Vi if and only if there is an Rj in Ci such that v appears in (Y)ji.. Select 
from each Vi a single variable; these variables will be independent. A variable corresponds 
to a choice of 1 index and 3 possibilities (since it will be of the form or li,i). 

If we were to choose a set of s independent variables from Y, roughly, to how many such 
/-tuples (i?i, . . . ,Ri) would this choice correspond, and in turn, to how many sequences 
X do these correspond? In other words, to how many non-zero contribution terms in the 
sum ()26() can a choice of s independent variables correspond? 

The answer is 0(1). 

Indeed, by the way we defined 2 and R, it follows that, once we have chosen a variable 
V € Vi, for all other variables in Vi we have a finite number of corresponding indices to 
choose from. Indeed, this happens because the correlation of RiS induces a "clustering" 
of variables (since all indices must be within \Vi\ x (A; -|- 1) < /(A; -|- 1) of each other). 
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Hence, for each of the possible 0{n^) choices of s "representative" variables, we have 
only 0(1) possible non-zero contribution terms in the sum (|26)) . 

Going backwards, it follows that for any s, there are 0{n^) terms for which the correla- 
tion graph has s components. Since each of these terms has weight at most l/n"? < 
we have proved the following lemma. 

Lemma 3.9. The contribution to the expectation sum (|26jl from all terms with s < 1/2 
or q > 1/2 is asymptotically negligible. 

Thus, the only terms of asymptotical significance are those for which s = q = 1/2. If I 
is odd, this immediately implies 

Lemma 3.10. With the notations above, for k and I fixed, I odd, 

lim E[L0ki7iy] = . 

ra— >oo 

Let us examine what happens when / is even and s = q = 1/2. Such terms are easy to 
understand: they correspond precisely to /-tuples . . . , Ri) for which \Ri\ = 1 for all 
i, and for each 1 <i <l there exists a unique 1 < j < I such that {Y)ji. = (Y)f{.. 

We make the following simple observation. 

Lemma 3.11. The number of diagonal terms Yjj contained in each {Y)ji., counting 
multiplicities, has to have the same parity as k. 

Proof. Indeed, by the definition of any R, all the diagonal terms found in {Hp^n)i must 
be found in (Y)r. The parity of these terms, counting multiplicities, has to be the same 
as the parity of k. This is easy to see; if T = (zi, • • • , ^fc), then by Definition 13.61 

n - 12 + ^2 - «3 + • • • + ik-i - ik + ik - h = , 

and since each difference ij — ij+i above is either 0, 1, or —1, it follows that the number 
of differences equal to has the same parity as k. The number of differences equal to is 
the number of diagonal terms. □ 

It then follows that, for all /-tuples of . . . , Ri) for which s = q = 1/2, 

• if A; is odd, all variables present in the (y)/j.s are diagonal variables, and 

• if A; is even, all variables present in the (Y)ji-s are off-diagonal variables. 

We summarize here what we now know about the terms we need to study when / is 
even. 

Lemma 3.12. The only asymptotically relevant terms have the property that there exist 
1/2 distinct indices ii, . . . , i//2 such that for each ij there exist precisely two values ji < j2 
for which 

1. if k is odd, Rj^ = Rj^ = {{ij,ij)}. 

2. if k is even, either one of these four possibilities: 

• Rji = Rj2 = + 1)}' or 
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• Rj^ = Rj., = {{ij + or 

• Rj^ = + 1)} and Rj^ = {{ij + or 

• Rj^ = {{ij + l,ij)} and Rj^ = + !)}• 

Note that in this case, = (y)ij,2; because the matrix is symmetric. 

We call all such terms significant. 

We will now need a stronger result than Remark 021 

Lemma 3.13. For any given e > and k,l £ N, with I even, there exists some i^ £ 

and the 



N such that for any significant term [Ylj=iiA)j^j E \[j=i{{Y)R^ - E[{Y)[i^]) 
corresponding l/2-tuplet {ii, . . . ,ii/2), if k < ii, . . . ,i;/2 l^n — i^, then 

• if k is odd. 



u=i 



l/^ / . \ k-l , 

n 1-^) i 

m=l ^ 



< e 




llmR^-E[(Y)R^]) 



1/2 

nil 

m=l 



n 



k~l 



2kl+l 



< e 



Proof. The lemma follows easily from Proposition 12. 19) and Remark 13. 11 together with the 
fact that if Ri contains the index j, then all indices present in Jj are within k oi j. □ 

We prove now that it is enough to look at the significant terms for which n — i^ > 
ill ■ ■ ■ ,ii/2 ^ ^ those covered by Lemma r3.13p . 

Lemma 3.14. The contribution of significant terms for which some ij > n — or ij < k 
is asymptotically negligible, i.e. o(l). 

Proof. Each contribution from a significant term 



\i=l 



llm.-E[{Y)n^]) 



is bounded by some constant M, by Lemmas 13.21 and l3.ll Since restricting a choice of ij 
to be greater than n — i^ or less than k yields a finite number of choices for that particular 
ij, and since j < I is finite, it follows that there are only 0(n'/^~^) such restricted terms. 
But since the contribution of any such term is weighed by l/n'^^, the statement of the 
lemma follows. □ 

So we have reduced the computation to examining the contribution from the terms for 
which n — i^ > 1^ . . . ,ii/2 ^ k. Assume w.l.o.g i^ > k (we can always choose a smaller e). 

Given an ordered l/2-tuplet of distinct indices n — i^ > 1, . . . , 2^/2 ^ ^) how many terms 
can correspond to them? First, there are (/ — 1)!! ways of pairing these indices to the RiS 
in this order. Second, once the pairing is given, 
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for k odd, such a term must have the corresponding Zj sequence be a sequence where 
all but one consecutive difference are ±1 (the one difference that is corresponds 
to the insertion of the diagonal term). There are ^^(ti)) such choices for each Tj, 

for a total of \k{\-\^ ) choices. 



for k even, such a term have the corresponding Ij sequence be a sequence where 
all consecutive difference are ±1, and one of these differences corresponding to the 
"marked" term that belongs to Rj. Taking into account all the 4 possible case, we 

obtain a total number of k'^{k) for each pair of matched 2jS, and total number of 



A;f fc) 1 ^ choices. 



Note that in either one of the two cases above, all choices of sequences are valid, because 
the indices in each sequence will stay between 1 and n (this is where we need that all 
k < ij < n — ie). 

Thus, the total number of significant terms which correspond to a given ordered 1/2- 
tuplet of distinct indices n — > 1^ . . . ,ii/2 > k ) is 



{I - m {k{t-i))' 



if k is odd, and 



• {I- 1)!! if A; is even. 

From Lemmas 13. 9| 13.121 l»'^-l<'^l and I.S.14| we obtain that for any given e, if k is odd, 



and so 



E 



1 



all significant terms 
with k < ij < n — ie, V j 



E 



ll{(Y)n^-E[iY)R^]) 



E[{uk{n)y]-{l-l)\l{k{\_l 



E 



1 



1/2 



n 1 



(n/3)V2 11 V ■ 4n 
all significant terms ^ 

with k < ij < n — if:, W j 
I 

n-ie>ii,.. .,ii/2 > k 
all ij distinct 



k-l 



< 



1 



e +0(1) , 



k-l 



(/-!)!!( A;( ) ) e(l + o(l))+o(l) 



e (1 + 0(1)) . 
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Since e was arbitrarily small, it follows that if we can compute 



all ij distinct 

we are done. But, since / and k are fixed, the sum in S is asymptotically the same as the 



value of the integral (4 Jq (^-4^)'^ ^ dx) , hence 



1/2 



1 TT / . « ) \ 1 11 



n-ie>ii,... ,ii/2 > k ■'"^ 



all ij distinct 

hence 

V2 1 , 



for arbitrarily small e. 

Similarly, for k even, we obtain through the same sort of calculation that 

for arbitrarily small e. 

Claim nT.2.1l is thus proved. □ 

Proof of Claim \3.2.iA The proof is based on the same idea as the proof of Claim 13.2.11 
the same reasoning applies to yield the asymptotical covariance result. □ 

Remark 3.15. Note that we never actually used the full power of the fact that the entries 
of the tridiagonal symmetric matrix Y tend to independent centered normal variables. 
We only used the following three properties: 

. E[Y,4 = E[Yi+i^i] = 0; 

• Var[Yi^i] = \, while lim„^oo Var\YiJ^i^i] = |; 

• for any k, there exists a number Mj^ > such that \E[(Yij)'']\ < Mj^, for all 1 < 
i,j ^ n (boundedness of moments). 

3.2 The /5-Laguerre case 

Given an integer k, consider the random variable 

%,^(n) = tr((L^,J^') - Ej^^^M{LlS)] ■ 
The main results of this section are given in the Claims below. 



27 



Claim 3.15.1. For any fixed integers k and I, 

1/2 



lim E^y.if^ 

n— »oo ' 



where 



[{rj^ (n))'] = I (l) {Sumi{k,-f) + Sum2ik,-f)y/^ {I - ly.l , if I is even , 
[ , if I is odd 



n ^ i-iy ^ fkVfk^' 

Sl + S2= j 
^ 1^ Si, S2 < k 

V J y + S2 = J 

< Si, S2 < fe— 1 

Claim 3.15.2. For any fixed integers k and I, 

2 

lim Cov{rji^^{n),r]j^^{n)) = -{Sumi{i,j,-/) + Sum2ii,j,-/)) , 

where 

l<r <i 

1 < s < i 

Q J ^ ^ J ' rp _|_ ^ J 

< r < i-1 
< s < j-1 

The method we employ for proving these claims is basically the same as in Section [3. 11 
the only things that change are the details of the sequences we will deal with, which in 
turn effect a change in the calculations, and yield the results of Claims Hi. 15. II and 13. 15.21 
In the following we will point out where definitions and calculations differ from before, 
but we will not go over the reduction arguments again, for the sake of brevity. 

We write the scaled matrix i?^ ^ as 

Bln = D + ^Z , 

where D = Eji^ ^[Bp ^] is the bidiagonal matrix of mean entries D{i, i) = ■^pE[x2a-ii3\ and 

D{i + l,i) = ■^=£'[x(n-i)/3]) and Z is the lower bidiagonal matrix of centered variables; 
we drop the dependence of D and Z on /3, a, and n, for simplicity. 



Sum2{k,j) 



<?=! J=<?+1 Vi-l' Si + S2=J 



2k-2 (2k\ 2k-2 , s 
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Remark 3.16. From Provosition \2.1iA if D = {dij)Kij<n, given any e > 0, there is an 
if G N such that 



\\/n/3/^ di^i - - < e, and 

I n/3/7 di+i^i - v^(n - i)P\ < e, 

for any i < n — if,. 

Here we also used the fact that 2a/ (nP) ~ I/7. 

Similarly, again from Proposition \2.1!]\ we know that Zi^i — > N(0,^/2) and — > 
A^(0,7/2) in distribution. 

As in Sectioning we start from the expression for tT{{BB^)^), for i? a lower bidiagonal 
matrix: 

tv[{BB ) )= ^n,i2^«2,i3 ■ • • ^j2fc-l,«2fe^i2fc,il ' 

l<il,i2,.--,«2fe<" 

where the smn is taken over sequences . . . ,i2fc) with the property that i2j-i — i2j £ 
{0,1}, for ah 1 < j < k, and Z2j — «2j+i £ {0,-1}, for ah 1 < j < /c — 1, and also 
i2k - ii G {0, -1}. 

Just as before, we will introduce a few notations (we "recycle" some of the notations 
we used before; note that the quantities change). 

Definition 3.17. We denote by Sn,k S {1, . . . , n}^'^' the set of sequences of integers 
ii, . . . ,i2k such that i2j-i — i2j £ {0, 1}, for all I < j < k, and i2j — i2j+i G {0, — 1}, 
for all 1 < j < k — 1, and also i2k — ii ^ {0, —1}. We denote by Z an element in Sn^k- 

For each such X, we consider all the ways in which we can "break up" I := (ii, . . . , i2k) 
into overlapping "runs" J and R, i.e. 



and 
with 



J — ((Vo' • • • 5 Vl)' (^2' • • • ) Va)' (^4 • • • Vs)' • • • ' (V2q' • • • ' V29+1)) ! 
R = ((^pi ) • • • ! V2)' (VS' • • • ' ^^4)) • • • ) (^P2g-l 5 • • • ) V2q)) ' 



(il, . . . , i2A;) — (*po ) • • • 5 Vl ' Vl+l' • • • ) ^P2 ' V2+I1 • • • ' ^P3' • • • ' ^P2<j+l ) • 

Note that this preserves the requirement that ij — ij+i G {0, (— l)"'^^-' ^"'^ ^} for all j. 

We allow for the possibility of having empty sequences ip^,. . . ,ip^ in the beginning 
and/or ip^^, . . . , ip2q+i ^• 

Definition 3.18. For any I, we introduce a set J of pairs {J,R), as described above. 
For a bidiagonal matrix B, we define 



{BB^h 


— ^n,j2^«2,«3 • 


■ ■ ^«2fc-l,«2fe^i2fc,«l ' 








(BB^h 




• bip^-l^ip^bip^bp2+l ■ ■ 




■ bi2q,i2q + l ■ ■ 


■ bi2q + l — l,i2q + l 


{BB^)r 


bip-^ yipl ■ ■ 


■ ^*P2~-'-'*P2 ^*P3*P3+-'- ■ ■ 


' ^^P4 ~^^^P4 ' ' 


■ bi2q-l,i2q-l- 


M • • • bi2q-l.i2q 



Remark 3.19. Note that any term in tr{{L^ will consists of terms in D and terms 
in Z , with a sequence of runs J recording the former, and a sequence of runs R recording 
the latter. 
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Proof of Claim W.15.SA As before, we note that 



D + ^z] (d + -^Z 



with P = P2-Pl + 1 + ...+P2q- P2q-1 + 1- 

Similarly with p6|l . write 



E 



X {J,R)eJ 



:{D)j{Z)r,{21) 



E E 



1 



inB)i , 

1<J<1 



\{{D)j^ E 



X{{{Z)i^-E[{Z)r^]) 



,(28) 



with (7 = ^. Pi/2. 

The rest of the argument follows in the footsteps of the proof of Claim 13.2.11 Just 
as before, it can be shown that the only terms with significant contribution are those for 
which, for each z, {Z)r^ consists of a single term, and, in addition to that, the set of Ris 
can be split in pairs {Ri^,Ri^) such that {Z)r^^ = {Z)ii^_^. This yields 

hm S[(r/,,^(n))2f+i] = . 

ra— >oo 

Also, if / is even, the argument that we can consider only the terms for which there is an 
Z/2-tuple (zi, . . . ,i//2) which "avoid" the upper and lower corners of the matrix (like in 
Lemmas 13. 131 and I3.14|) still applies. 

The one way in which this computation will differ from the one we made for the proof 
of Claim 1^.2.11 lies in the fact that approximating {D)j, given an index ii present in J, 
becomes a little trickier, since the diagonal and off-diagonal elements will approximate 



respectively to ^yl/7 - i and ^/y^l- ^• 

Thus, one more parameter will become important, namely, the number of off-diagonal 
terms in each {D)j-. Note that in each sequence 2 we must have an even number 2s of 
off-diagonal terms (either from D or from Z), since ii — 12 + . . . + i2k — H =0, and this 
also implies that we have an even number 2{k — s) of diagonal terms. 

1. Suppose we fix the term in Zr. to be the diagonal term -Zi^^ij ; to how many sequences 
I with a fixed number 2s of off-diagonal terms can this correspond? The answer 
is 2{k — s)(^) ; we have (^) ways of picking the off-diagonal terms (because of the 



alternating property), and once those are picked we have 2{k 
location of ^i^^ii among the diagonal terms remaining. 

Each such sequence will have asymptotical weight 



s) choices for the 



{D)j ~ 7 



fc-1/2 



1 



-1/2 



2. Suppose we now fix the term in Zr. to be the off-diagonal term Zj^+i^j^; to how 
many sequences I with a fixed number 2s of off-diagonal terms can this correspond? 
The answer is 2s (^) ; we have (^) ways of picking the off-diagonal terms (because 



30 



of the alternating property), and once those are picked we have 2s choices for the 
location of Zj^+i^j^ among them. 

Each such sequence will have asymptotical weight 

fc-s / . \ s-1/2 



Finally, using the binomial formula 

^1 s^l 



and after some processing and use of the Riemann-sum and Beta-function formula 

2k—r—s / , \ r+s—l ^1 



2k-r-s 



hm (-) = / {1-x 

{2k-r-s)\ (r + s-1)! 



(2fc)! 

combined with all the possible pairings of the RiS (which yields the necessary [I — 1)!!), 
we obtain the result of claim 13.15.11 □ 
Claim IS". 15.21 has a similar proof. 

Remark 3.20. As in Section Vj.lV we never actually use the full power of the fact that the 
entries of the bidiagonal matrix Z tend to independent centered normal variables. We 
only used the following three properties: 

. E[Zi^i] = E[Zi+i^i] = 0; 



limn^oo Var[Zi+i^i] = lim„^oo Var[Zi^i] 



1 ■ 

2' 



• for any k, there exists a number > such that \E[{Zij)^]\ < M^, for all 1 < 
*)j l^n (boundedness of moments). 

4 A more general setting 

We present here a way to generalize the "non-random matrix + small random fluctuation" 
decompositions we have used in Section |31 in order to analyze the deviation and fluctuation 
from the asymptotical law for the eigenvalue density in the case of the Hermite and 
Laguerre /3-ensembles. 

4.1 Tridiagonals 

Let / and g be two integrable functions /, : [0, 1] ^ M such that the integrals f°'{x)g^{x)d2 
exist for all a, 6 G N, and such that the quantities 
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are the moments of a (uniquely determined) distribution defined on a compact [a,b]. 
For any n G M, we then consider the n x n matrix Ft'. 



Ft 



( fil) am 

am fm am 
am ■•• 



V 



••• a{^ 



so the diagonal of Ft is an equidiscretization of / with step 1/n (with the exception of 
the endpoint at 0, which is missing), and the off-diagonal is an equidiscretization of g with 
step 1/n (missing both the endpoint at 1 and the one at 0). 
We consider the tridiagonal symmetric matrix 



Rt = 



^ -^n Un—l 

Vn-l Xn-1 yn-2 

yn-2 



V 



2/2 X2 2/1 

2/1 xi j 



with the variables Xi and being mutually independent and satisfying the following three 
properties: 

• E\x^ = E\yj\ = 0, for all 1 < i < n, 1 < j < n - 1, 

• Var[xi] = £7^, for all 1 < i < n, and Var[i/j] = r/^, for all 1 < j < n — 1, 

• for all k there exists a > such that |£;[(a;i)'=]| < and |£^[(2/j)*^]| < M^, for 
all 1 < i < n, 1 < J < — 1. 

Note that F^ is a non-random matrix, while Rt is a random one. 
Now we consider the matrix model 



Mr = Ft + —!=Rt ■ 



(30) 



We will compute the asymptotical eigenvalue distribution, the first-order deviation 
from it, and the first-order fiuctuation, for the random matrix Mt- 
We need two more definitions. 

Definition 4.1. Let P be a path on the lattice 1? , starting at (0,0) and ending at {k,0), 
with up {{x,y) — > (a; + l,y+l)), down {{x,y) {x + l,y — l)), and level {{x,y) {x+l,y)) 
steps. For each level j £ Z, we define the quantities aj{P) and bj{p), as follows: 



a,{P) 
bjiP) 



^of level steps from j to j; 
^of down steps from j to j 



1 . 



Note that, since the path P ends at {k,0), the number of up steps it takes must always 
equal the number of down steps it takes. 
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Also let 



'Pr,k 
Pr,k,i 



{paths from (0,0) to {k,0) with exactly r down steps}, 
{paths in Vr,k which descend to, but not below, y = —i}, 

\T^r,k,i I ) 

LfJ 



Theorem 4.2. Let Mj- be a matrix from the ensemble defined by (|30jl . of size n, with 
eigenvalues (Ai, . . . , A„), and let k > 1 be a positive integer. For all i = 1, . . . ,k, let 



r-l 



'k,j,r 



r=0 



'r-l 



1 

2 \r, r,i — 2r 
3 



+ 



+ /(0)-.(0)- 2Vr,M-2, 

\i=o 

- E E^'"^(^) f r^''-\t)g^'{t)nt)dt - 

PePr,i jez 

- 2 E E^'^^(^) f r''{t)9'''^\t)9'm + 



E E 

. Pen-,* iez 



)j £ f'-''~\t)g^''it)dt + 



26, (P) 



f'-'^{t)g'^-\t)dt . 



Also, for any I < i < k, let 



Xi = triiMTT) - n / x'p{x) dx - 



j=i •^'^ 

Xei (11,12) • • • ^Yk) be a centered multivariate Gaussian with covariance matrix 

"1 



Cov(Yi,Yj) = Yl (^ - 2r)(j - 2s) 
< r < [|J - 1 

< s < LiJ - 1 

+ T]"^ Y2 ^''•5 

1 < r < L§J 
1 < s < LiJ 



r,r,i — 2r J \s, s, j — 2s 



f^^'^''\x)g^'^''''\x) dx + 







r,r,i- 2rJ \s,sj - 2s J Jo 



/^("'^)(x)5^("'^)(x) dx 
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where A(r, s) = i — 2r + j 
D{r,s) =i-2r+j -2s. 
Then, as n —> oo, 



2s - 2, B{r,s) = 2r + 2s, C{r,s) = 2r + 2s - 2, and 



{Xi,X2, ■ ■ ■ ,Xk 



{YuY2,...,Yk 



Proof. A technical but simple calculation in the spirit of the ones performed in Section 
13.11 shows that are the moments of the asymptotic level density, via the expansion of 
tr(M^) as a sum of products of the entries of Mt and separation of the zero-order term. 
Furthermore, examining the first-order terms in this expansion yields the covariance result, 
using the same counting techniques as in Section f3. II 

Computing the first-order deviation from the mean is slightly more complicated, as 
first-order terms in the expansion come from four sources; we only enumerate these sources 
here and indicate how they play a part in the total sum. 

We begin with expressing 

tr ' ' " 1 - > 



£;[tr(M|)] = E 



Ft + —^Rt 

In 



E 



E 



E 



E 



n 



P/2 



{Ft)j {Rt)r 



where we have used the notation of Section 13.11 

It is easy to prove, like we did in Section [3.H that the zero-order terms are given by 
the the pairs (J, R) where = 0, that the terms {Rt)r which contain a single variable 
(at first power) are annihilated by the expectation (since all variables in Rt are centered), 
and that the terms where {Rt)r contains three or more variables (counting multiplicities) 
do not contribute to the first-order deviation. 

To each sequence 2 (recall that 2 = {ii, . . . ,ik) with \ij G {0, 1} for 1 < i < k — 1 

and — ii\ £ {0,1}) we associate in a one-to-one fashion, a path from (0,0) to (A;,0) 
taking steps up, down, or level (depending on the nest term being larger, smaller, or equal 
to the current one). The zero-order terms sum asymptotically to (with the integral 
being obtained from the Riemann sum, and (r^fc_2r) = l^fcD- 

Three of the first-order term sources come from those terms that have =0, while 
the fourth comes from the terms for which {Rt)r contains a single variable, at the second 
power. Note also that the terms for which {Rt)r contains two different variables will be 
annihilated by the expectation. 

Source 1. In the zero-order count, we ignore the fact that at the "edges", i.e. upper left corner, 
corresponding to ii G {1, . . . , [f J}, and lower right corner, corresponding to ii G 
{n — [|j , . . . , n}, not all paths in Vk,r can appear in the sum. This approximation 



yield a first-order term which is asymptotically equal to 

r=0 \ \i=0 ^ 



k 

r,r, k — 2r . 
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Source 2. In the zero-order count, we approximate the value of the integral ^''(x) g^^{x) dx 



by the Riemann sum - Y11=q f {~)^ ^^S'(-)^^- This yields a first-order term asymp- 
totically equal to^ 

L-J 

Source 3. Finally, in the zero-order approximation, we replace / 9 (^^) (n)' 

respectively g (^) for each j G {—k, . . . , k}. A Taylor series approximation and the 
counting of terms shows that this way we need to subtract off have a first-order term 
which is asymptotically 

L-J 

S^--=i2 E f f'^^'-\t)9^'{t)nt)dt-2jh,{P) f f'-^'{t)g^''\t)g'{t)d^ 

Source 4- The last source of first-order terms comes from the terms in which {Rt)r contains 
a single variable at the second power, and its contribution is asymptotically equal 
to 

E E('"'f^)) [ f'-'\t)g^^~\t)dt 









Finally, adding 5*1, S2, S3, and 6*4 yields the statement of Theorem 14.21 □ 

Remark 4.3. Note that when f{x) = \fxl2, g(x) = for all x G [0,1], o"^ = 2)3; ^.f^d 
Tj^ = g^, both the level density asymptotics and the covariance matrix for the fluctuations 
are the same as for the (3-Hermite ensemble of Section \3.1{ 

Crucially, the deviation is different. The reason is that in the approximation 



2 

the next order term is of order . , , which plays a part in computing the deviation. 

V("-j)/3 

We remind the reader that we computed the deviation for the f5-Hermite ensembles 
by using the palindromic property of expectations of trace, thus reducing the problem to 
computing the deviation for the "(3 = 00" case, for which we used Hermite polynomials 
properties. This allowed us to find the distribution behind the moments of the deviation. 



One can for example use the Euler-Maclauren approximation formulas to obtain this. 
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4.2 Bidiagonals 



Let / and g be two integrable funetions, /, g, : [0, 1] M, such that the integrals 
Jo f^i^) y^l^) '^^ exist for all a, 6 G N, and such that the quantities 



r=0 •'^ 



are the moments of a (uniquely determined) distribution ly defined on [a,b]. 
For any n G N, we consider the n x n matrix Fb, defined below: 



Fb 



V 



5(h) /(h) / 



in other words the diagonal of Fb has an equidiscretization of / with step 1/n (with the 
exception of the cndpoint at 0, which is missing), and the subdiagonal is an equidiscretiza- 
tion of g with step 1/n (missing both the cndpoint at and the one at 1). 
We next consider the random bidiagonal matrix 



Rb 



Vn—l ^n—l 

V 



yi xi J 



where the variables Xj, yj are mutually independent and satisfying the following proper- 
tics: 

• E[xi] = E[yj] = 0, for ah 1 < i < ra, 1 < j < n - 1, 

• Var[xi] = for all 1 < z < n and Var[yj] = rj^, for all 1 < j < n — 1, 

• for all A; there is a constant > such that |E[(a;i)'^]| < and |£;[(yj)'^]| < Mk 
for all 1 < z < n and 1 < i < n — 1. 



Finally, consider the matrix 



with 



Ml = MbMI , 



Mb = Fb + -;=Rb ■ 



(31) 



Note that while Fb is a non-random matrix, Rb-, Mb, and Ml are random. 
Wc will compute the asymptotical level density and the first-order deviation and fluc- 
tuation for the random matrix Ml- 

We need to define the following quantities. 
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Definition 4.4. Let Q he a path on the lattice 1? , starting at (0,0) and ending at {2k, 0), 
with up {{x, y) {x+l, y+1)), down {{x, y) (x+l, y—\)), and level ((x, y) (x+l, y)) 
steps. In addition, we require that the path is alternating, i.e. on each odd-numbered step 
(first, third, etc.) the path is only allowed to go down or stay at the same level, whereas 
on each even-numbered step (second, fourth, etc.), the path is allowed only to go up or 
stay at the same level. 

For each level j E Z, we define the quantities Cj{Q) and dj{Q), as follows: 

Cj{Q) := #o/ level steps from j to j; 
dj{Q) := #o/ down steps from j to j — 1 . 

Note that, since the path Q ends at {2k, 0), the number of up steps it takes must always 
equal the number of down steps it takes. 
Also let 



Qr,k 
Qr,k,i 

Qk 



{alternating paths from (0,0) to {2k, 0) with exactly r down steps}, 
{alternating paths in Qr^k which descends to, but not below, y = — «}, 

I Qr,k,i\ 1 
Ur=0 Q-r,k ■ 



Theorem 4.5. Let Ml be the matrix from the ensembles defined by (|31|) . of size n, with 
eigenvalues (Ai, . . . , A„), and let k > 1 be a positive integer. For all 1 < i < k, let 



A. = ±{mr-''9{lr{E(--^)%.r-l(ly 

r=0 \ \j=0 ^ ^ 

+ /(o)^'-"^(o)-[e(--^W-^Q 



E E^'^^(Q) [' f'-'''-Ht)9''{t)f'{t)dt - 

2 E E-?'^^(^) [ f''^''ii)9^"'\i)9{t)dt + 



Also, for for any 1 < i < k, let 



+ E E {'f^)] [f'-''~wm + 

E E('To)r^"""^'^^""^'^''- 



Xi = tr{{MiY) -n a;V(x) dx - fn , 

J a 

i r-b 

= ^] ~ ^ / 2;V(x) dx — fii 

.7 = 1 
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Let (yi,l2, . . . he a centered multivariate Gaussian with covariance matrix 



Cov{x\x^) = 4cj2 Q f^^'-''\x)g^^''''\x) dx + 



l<r <i 
1 < s < J 



< r < i — 1 
< s < i - 1 

mi/i A(r, s) = 2(r + s — 1) and B{r, s) = 2{i + j — r — s). 
Then, as n ^ n, 

{Xi,X2,...,Xk)^{Yi,Y2,...,Yk) . 

Proof. The proof of Theorem 14.51 is based on the same calculations as Theorem 14.21 The 
covariance can be computed using the same general principles as in Section [3.21 and the 
examination of the zero-order and first-order terms in the mean can be done as in Section 
14.21 Moreover, the sources of the first-order terms are the same as in Section 14.21 it is 
only the type of path we are counting that changes (from paths of length k to alternating 
paths of length 2k). □ 



Remark 4.6. Note that when f = — 1 + x, g = \fx, — — hoth the level 
density asymptotics and the covariance matrix for the fluctuations are the same as for 
the P-Laguerre ensemble of Section \3.'A 

Once again, the deviation is different, for the same reason as in Section\4.^' in the 



approximation E[^X(n~-i)i3] ~ ^ Y^(n — i)P the next order term is of order . ^ ^ , which 

plays a part in computing the deviation. 

We computed the deviation for the P-Laguerre ensembles by using the palindromic 
property of expectations of trace, thus reducing the problem to computing the deviation for 
the "P = 00" case, for which we used Laguerre polynomials properties. This allowed us to 
find the distribution behind the moments of the deviation. 



5 Histogramming eigenvalues efficiently 

We propose a very effective numerical trick for counting the number of eigenvalues in an 
interval numerically. This method does not require the computation of eigenvalues and 
requires a number of operations that is 0(n), rather than 0{n?), which allows for counts 
for matrices of a very large size. The method is the standard Sturm sequence method for 
tridiagonal symmetric matrices. We take as input D, a vector of length n, which is the 
diagonal of the matrix, and E, a vector of length n — 1, the squares of the elements on 
the super or subdiagonal. This avoids unnecessary square roots in the formation of the 
matrix which can slow down computation. 

The algorithm is remarkably simple. For a given value a, which is not an eigenvalue 
of the matrix, compute 

ti := Di — a — 
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for t = 1 through n with to = 1 and Eq = 0. 

From Sturm's theorem (see for example |17| Theorem 13]) we know that the number 
of eigenvalues of the tridiagonal matrix less than or equal to a is the number of tjS that 
are negative, with 1 < i < n. 

To histogram eigenvalues of /3-ensembles for n's into the millions or even the billions, 
one can simply compute E from a chi-square random number generator (implemented 
in MATLAB's Statistics Toolbox as chi2rnd), and D from a normal random number 
generator for the /?-Hermite case (randn) or from another chi-square in the /3-Laguerre 
case. 

An interesting special case is the /3-Hermite case with /3 = oo, which computes the 
roots of the corresponding scaled Hermite polynomial. This may be performed by taking 
D to be the zero vector of length n and E = {n — l,n — 2, . . . ,2, l)/(4n). 

When P = oo, there is no fluctuation, but there are deviations for large n. Roughly 
the theorem states that the number of eigenvalues in an interval / is linear of the form 

(# eigenvalues) = n(area under the semicircle) + DEVIATION , (32) 

where 

DEVIATION = —arcsin{x) (33) 
2tt I 

and we subtract 1/4 if I contains +1 and 1/4 if I contains -1. 

In one numerical experiment (see Figure[2I), we took n = 1, 000, OOO+i for i = 0, 1, 99 
and computed the deviation from the mean, i.e. the number of eigenvalues in the interval 
/ minus the area A = Jj \/l — x'^ dx; we did this arbitrarily for the interval I = [.2 .8]. 
Since this experiment is non-random it is repeatable without any reference to a random 
number generator. We found the experimental deviation of 0.1167 which is close to the 
theoretical deviation of .1155, given by 




20 40 60 80 100 
size of matrix, translated by 1 ,000,000 



Figure 2: Calculating the deviation in the interval I = [.2 .8] for the /? = oo Hermite 
ensemble with sizes 10^ : 1 : (10^ -|- 99); circles represent the eigenvalue count minus the 
area under the semicircle over /; the solid line is the theoretical deviation given in (|33|) 
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Figurel^lplots the results of the experiment. The red hne which contains the theoretical 
value of the intercept represents the best fit line to the data in the sense that the average 
vertical deviation is minimized. 

6 Remarks and open problems 

There are many object-counting (combinatorial) approaches to the study of traces of 
powers of random matrices; they depend on the matrix model, and on the polynomial 
whose trace is being computed. For example, the counting approach of jHl] uses full 
matrix models (all entries are non-zero variables), and traces of powers (thus using the 
monomial basis, like we have done here), and counts paths in the complete graph of size 
n. By contrast, in [201, polynomials used are the shifted Chebyshev polynomials, and 
the objects counted are non-crossing annular partitions; the matrix models are still full. 
Here, we use tri/bidiagonal matrix models, consider the monomials, and count essentially 
paths with three types of steps (up, down, level) in the plane. 

Though the objects we count here are simpler than in [22j, our counting technique 
expresses the results in a less compact form than in |21| and [^H]- In the latter two 
papers, the covariance matrix is diagonalized by the choice of polynomial basis, whereas 
in our paper it is obtained as full because we work with the monomials. There seems to 
be a trade-off between the simplicity of the object to be counted and the simplicity of the 
form in which the covariance matrix is expressed. 

We would like to conjecture that by using a hybrid way of counting, for example, 
using the tridiagonal matrices and some of the techniques of |26j . both the counting 
process and the resulting format of the answer could be simplified. The development of 
such a technique would be of great interest. 
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